GAN-Based Training of Semi-Interpretable Generators for Biological Data Interpolation and Augmentation

نویسندگان

چکیده

Single-cell measurements incorporate invaluable information regarding the state of each cell and its underlying regulatory mechanisms. The popularity use single-cell are constantly growing. Despite typically large number collected data, under-representation important (sub-)populations negatively affects down-stream analysis robustness. Therefore, enrichment biological datasets with samples that belong to a rare or manifold is overall advantageous. In this work, we train families generative models via minimization Rényi divergence resulting in an adversarial training framework. Apart from standard neural network-based models, propose semi-interpretable models. proposed further tailored generate realistic gene expression measurements, whose characteristics include zero-inflation sparsity, without need any data pre-processing. Explicit factors such as measurement time, cluster taken into account by our conditional variables. We compare them against state-of-the-art on range synthetic real demonstrate their ability accurately perform interpolation augmentation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning interpretable representations of biological data

The increasing ease of collecting genome-scale data has rapidly accelerated its use in all areas of biomedical science. Translating genome scale data in to testable hypothesis, on the other hand, is challenging and remains an active area method development. In this talk we present two machine learning approaches to deduce data representations that are inspired by a mechanistic understanding of ...

متن کامل

BAGAN: Data Augmentation with Balancing GAN

Image classification datasets are often imbalanced, characteristic that negatively affects the accuracy of deeplearning classifiers. In this work we propose balancing GANs (BAGANs) as an augmentation tool to restore balance in imbalanced datasets. This is challenging because the few minority-class images may not be enough to train a GAN. We overcome this issue by including during training all a...

متن کامل

Semi - supervised Learning Methods for Data Augmentation

The original goal of this project was to investigate the extent to which data augmentation schemes based on semi-supervised learning algorithms can improve classification accuracy in supervised learning problems. The objectives included determining the appropriate algorithms, customising them for the purposes of this project and providing their Matlab implementations. These algorithms were to b...

متن کامل

Synthetic Data Augmentation using GAN for Improved Liver Lesion Classification

In this paper, we present a data augmentation method that generates synthetic medical images using Generative Adversarial Networks (GANs). We propose a training scheme that first uses classical data augmentation to enlarge the training set and then further enlarges the data size and its diversity by applying GAN techniques for synthetic data augmentation. Our method is demonstrated on a limited...

متن کامل

THE USE OF SEMI INHERITED LU FACTORIZATION OF MATRICES IN INTERPOLATION OF DATA

The polynomial interpolation in one dimensional space R is an important method to approximate the functions. The Lagrange and Newton methods are two well known types of interpolations. In this work, we describe the semi inherited interpolation for approximating the values of a function. In this case, the interpolation matrix has the semi inherited LU factorization.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app12115434